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Cytotoxic cyplasin of the sea hare, Aplysia punctata, cDNA 
cloning and expression of bioreactive recombinants 

The present invention relates to a nucleic acid coding for a 
protein called "cyplasin" that shows a preferential toxicity 
to autonomously growing mammalian cells. Cell death induced by 
this protein differs from both apoptosis and necrosis. An 
intracellular cell death which occurs when recombinantly 
preparing cyplasin in cell cultures can be avoided by removal 
of the secretion signal in the cyplasin seguence. This 
modification makes it possible to express the cyplasin in a 
mammalian cell culture which is preferable with regard to the 
glycosylation pattern of the obtained protein. Thus, the 
present invention also relates to a method of recombinantly 
producing a protein in eukaryotic cells, preferably mammalian 
cells, which is cytotoxic for said cells when applied 
externally. - 

Marine organisms represent an essentially unexploited 
reservoir for genes and metabolic products of potential 
biological and/or pharmacological interest [1, 2, 3] . So far, 
literature on natural products derived from marine organisms 
is dominated by low molecular weight compounds characterized 
by cytotoxicity. A number of such natural drugs are either 
clinically applied or under evaluation as potential anticancer 
drugs [1, 2, 3] . In contrast, reports on exploitable genes 
from marine organisms and their products are rare. The green 
fluorescent protein from the jellyfish Aeguorea victoria may 
serve as an example for a gene of basic biological interest, 
which is widely used in biotechnology as reporter for studies 
on gene expression and protein localization in living cells 
[4] . 

Sea hares appear to represent another species producing high 
molecular weight gene products of interest. Originally, the 
toxicity of the mollusc Aplysia was found to be due to low 
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molecular weight metabolic substances deriving from algal diet 
[5]. However, cytolytic, antimicrobial and antifungal 
activities could be detected in biochemical isolates of high 
molecular weight from the sea hares Aplysia kurodai, Aplysia 
juliana and Dolabella auricular ia. Accordingly it was 
suggested that these organisms might produce water-soluble 
gene-expressed biopolymeres of pharmacological interest [5, 
6] . Furthermore, these biochemical investigations suggest that 
sea hares produce a number of closely related glycoproteins of 
different sizes and with different biological activities. 
First attempts to characterize these proteins on the sequence 
level led to the molecular cloning of one Aplysia kurodai- 
derived cDNA which showed significant sequence identities with 
the cDNA encoding a protein produced by the giant African 
snail Achatina fulica [7]. However, a clear correlation of the 
protein encoded by the cloned Aplysia kurodai cDNA with any 
biological activity is missing. This is most likely due to the 
fact that the biologically active molecules are glycoproteins 
and that recombinant expression in E. coli results in 
biologically inactive proteins. 

Thus, the technical problem underlying the present invention 
was to provide means for recombinantly producing cytotoxic 
proteins like cyplasin from Aplysia in a biologically active 
form. 

The solution to said technical problem is achieved by 
providing the embodiments characterized in the claims. 

The potential pharmacological value of Aplysia-derived 
proteins stimulated the approach of the present inventors to 
identify cytotoxic activities of the European sea hare Aplysia 
punctata on the sequence level. A bioassay-guided 

fractionation of the secreted mucus of albumen glands released 
a 56 kDa glycoprotein which showed cytotoxic effects on 
autonomously growing cells in nanomolar concentrations. Based 
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on its cytotoxicity, its possible effects on neoplasia and its 
origin Aplysia, the protein was termed cyplasin. Cyplasin 
shows a preferential toxicity to autonomously growing 
transformed mammalian cells. Cell death induced by this 
protein differs from both, apoptosis and necrosis. The 
cytotoxic effects are irreversible and become apparent, at 
nanomolar concentrations in a cell type-dependent manner. In 
contrast, injection of micromolar concentrations into mice is 
tolerated without apparent negative consequences. 
Micros equencing of the 56 kDa protein released a peptide 
sequence whose corresponding nucleotide sequence was used as 
probe to screen Aplysia punctata RNA-based cDNA and to select 
cDNA clones encoding polypeptides comprising the target 
peptide. Two closely related cDNAs were detected. The cDNA 
encoding a polypeptide 558 aa in length was considered to 
reflect a bona fide clone encoding the cytotoxic protein. Its 
protein-coding section was recloned in vectors suitable for 
expression in E. coli, in mammalian cells and in insect cells, 
respectively. The E. coli-expressed polypeptide was 
biologically inactive. Transfected mammalian cells expressed a 
cytotoxic factor and died thereof as if treated with the 
genuine cytotoxic protein.. In contrast, transfected insect 
cells which proved to be much less sensitive when treated with 
the genuine protein expressed the cytotoxic factor and. 
continued to proliferate allowing to establish stable insect 
cell lines expressing sufficient amounts of the cytotoxic 
factor for further characterization. Finally, it could be 
shown that a biologically active protein could be 
recombinantly produced in mammalian cells when using a DNA 
sequence encoding the protein without the secretory signal 
sequence. 



Figure legends 

Figure 1: SDS-polyacrylamide gel electrophoresis of cyplasin 
isolated by a bioassay-guided fractionation of the secreted 
mucus of A. punctata 

The figure shows a 12 % SDS polyacrylamide gel loaded with the 
most active fraction (lane cyplasin) . The proteinaceous 
material migrates with an apparent molecular mass of 56 kDa. 
Lane M is loaded with marker proteins. 

Figure 2; 

(a) Amino acid sequences of precursor proteins derived from A. 
punctata cDNAs comprising the nucleotide sub-sequences coding 
for the (underscored) internal peptide SGDYILIASYAD 

The upper sequence (558 aa residues) is derived from the 
nucleotide sequence of the cDNA encoding the polypeptide 
termed cyplasin-L, and the lower sequence (421 aa residues) is 
derived from the nucleotide sequence of the cDNA encoding the 
polypeptide termed cyplasin-S. The nucleotide sequences are 
found in databases under the accession numbers AJ304802 
(cyplasin-L cDNA) and AJ304801 (cyplasin-S cDNA) . In addition 
to these clearly distinguishable transcripts other mRNAs may 
exist with additional differences. PCR with total cDNA as 
template and cyplasin-L specific primer pairs releases 
sequences slightly differing from the cloned cyplasin-L and 
cyplasin-S encoding cDNA sequences. Amino acid exchanges 
detected by the PCR procedure are indicated in brackets. Asn- 
linked glycosylation sites are found at aa positions N-151, N- 
271, N-401, N-416 and N-422. The putative cleavage point of 
the secretory signal sequence is between aa 52 (S) and aa 53 
(A) . 

(b) Nucleotide sequence of the protein Cypl-Mut- (-Sig. Seq) 

Figure 3: Insect cells (Sf9) transfected with the pIZ vector- 
driven construct expressing cyplasin-L-EGFP 
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The upper panel shows Sf9 cells in bright field and the lower, 
panel shows the identical section in fluorescence mode (515 
nm) . Bar 10 pM. 

Figure 4: Enrichment of the recombinant cyplasin-L-EGFP fusion 
protein in cytotoxic protein fractions released from SF9 cells 
Extracts containing the cytotoxic factor were prepared from 
SF9 cells expressing cyplasin-L-EGFP as described under 
materials and methods. Identical samples were separated on a 
12% polyacrylamide gel. Polypeptides run on parallel gel 
sections together with a protein size marker were either 
visualized by a silver-staining procedure or blotted to a PVDF 
membrane. The membrane was probed with an anti-EGFP antibody 
and immuno-complexes formed were visualized by means of an 
alkaline phosphatase-coupled second antibody. (a) shows the 
protein size marker (b) shows the prominent polypeptides 
present in the extract, (c) shows the antigen detected by the 
EGFP-specific antibody. It should be noted that the anti-EGFP 
antibody detects a polypeptide in the order of 70 kDa which is 
significantly larger than EGFP (27 kDa) . This result indicates 
the enrichment of the EGFP-tagged fusion protein in the 
cytotoxic fraction. The calculated molecular mass of the 
fusion protein between the cyplasin-L precursor protein (57.2 
kDa) and EGFP is 84.2 kDa. The processed cyplasin-L with 
deleted signal sequence has a calculated molecular mass of 
41.6 kDa resulting in a molecular mass of 68.6 kDa when fused 
to EGFP which is close to the size of the fusion protein 
detected on the blot. Accordingly, it has to be assumed that 
the cytotoxic extract contains the EGFP-tagged and processed 
cyplasin-L. 

Figure 5: Cytotoxic effects of genuine and rec ombinant 
cyplasin-L-EGFP 

Four different cell lines were treated for five hours with 
genuine cyplasin and with standard extracts (Example 1) from 
SF9 cells stably expressing cyplasin-L-EGFP. Genuine cyplasin: 
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Primary human skin fibroblasts (HSF) , incubated with 50 nM 
cyplasin. At this concentration HSF cells show a slight but 
typical reaction that implies retraction of the cell membrane 
and partial detachment- Cell death is not observed at this 
concentration. The cells recover and continue to proliferate. 
Primary human melanoma cells derived from biopsies are more 
susceptible to the cytotoxic effect of cyplasin than HSF 
cells. After addition of cyplasin (2 nM) the cells show the 
typical cyplasin-induced membrane changes and finally die. 
Glia cells from a permanent cell line originating from the 
brain cortex of rat embryos are most sensitive when treated 
with cyplasin. Addition of 0.5 nM cyplasin is sufficient to 
induce cell death. Rat kangaroo PtK cells require 2 nM 
cyplasin to exhibit the morphology of dying cells. Recombinant 
cyplasin-L-EGFP: Standard extracts of recombinant cyplasin-L- 
EGFP (100 \xl / 500 ]il medium) show, in parallel cultures , 
essentially identical and graded cytotoxic effects. Bar 10 pM. 

Figure 6: Dose-response curve of cyplasin for various cell 
lines 

Glia cells are the cells most sensitive to cyplasin. Less than 
1 nM cyplasin suffices to kill the majority of them. Primary 
human melanoma cells and PtK cells show also high sensitivity 
to cyplasin, whereas HSFs are much more tolerant; only a dose 
as high as 100 nm cyplasin will kill these cells. 

Figure 7 : Apoptotic cell death induced by staurosporine and 
cell death induced by genuine cyplasin 

PtK cells were treated with 10 nM cyplasin for 5 hours (upper 
panel) , or with 1 ]xg/ml staurosporin for three hours (lower 
panel) . The cells were stained with a mixture of FITC-labeled 
annexin V and propidium iodide as described elsewhere in 
detail [8] . The FITC-Annexin V staining shows the 
characteristic translocation of phosphatidylserine from the 
inner to the outer side of the plasma membrane. No FITC- 
Annexin V staining is found in cyplasin-treated cells that 
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show the characteristic cyplas in-induced morphologxcal 
changes. Neither staurosporine nor cyplasin permeabilize the 
cells, which is revealed by missing propidium iodide staining 
of nuclei. Bar 10 uM. 

Figure 8: Anaphase progress of a PtK cell present in a culture 
treated for one hour with 2 nM genu ine cvplasin 

From upper left to lower right: No interference is observed 
with the process of anaphase, which is terminating in an 
apparently normal cytokinesis. After entering interphase this 
cell showed the typical cyplasin-induced changes in 
morphology. Bar 10 um. 

Figure 9: Effect of cyplasin on th e actin cytoskeleton of 
human primary melanoma cells 

Cvplasin (10 nM) causes a fast depolymerization of actin 
fibers, with the exception of the cortical area where 1-actin 
staining persists (arrows). (a) Intreated control; 8b) 30- 
minute cyplasin incubation; (c) 60-minute cyplasin incubation; 
(d) 90-minute cyplasin incubation; (e) 120-minute cyplasin 
incubation; (f) 150-minute cyplasin incubation. Bar 10 um. 

Figure 10: Prediction of the signal peptide and its cleavage 
site in the N-terminal amino acid sequence of cyplasin 
The highest probability for cleavage was determined to be 
between aa positions 19 and 20 or (with lower probability) 
between aa positions 52 and 53. 

Figure 11: Amino acid sequence of the cvplasin with removed 
secretory leader sequence 

In human cells transfected with cDNA encoding this cyplasin 
variant of the protein (which is still cytotoxic) remains in 
the cytoplasm of the cells. 

Figure 12: Micrograph of HeLa cells transfected with the 
modified cvplasin cDNA 
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Lower panel: bright field; upper panel: identical section in 
fluorescence mode. All cells contain EGFP and, as a 
consequence, cyplasin . 

Thus, the present invention relates to an isolated nucleic 
acid molecule encoding the protein cyplasin with a deleted or 
non- functional secretory signal sequence or a protein 
exhibiting biological properties thereof, being* selected from 
the group consisting of 

(a) a nucleic acid molecule encoding a protein comprising 
the amino acid sequence from position 20 or 53 to 
position 558 of Figure 2 (a); 

(b) a nucleic acid molecule comprising the nucleotide 
sequence of Fig. 2(b) 

(c) a nucleic acid molecule the nucleic acid sequence of 
which deviates from the nucleic sequences specified in 
(a) or (b) due to the degeneration of the genetic code; 
and 

(d) a nucleic acid molecule, which represents a fragment, 
derivative or allelic variation of a nucleic acid 
sequence specified in (a) , (b) or (c) . 

As used herein, a protein exhibiting biological properties of 
cyplasin is understood to be a protein having at least one of 
the biological activities of cyplasin, e.g. cytotoxic 
activity. 

As used herein, the term "isolated nucleic acid molecule'' 
includes nucleic acid molecules substantially free of other 
nucleic acids, proteins, lipids, carbohydrates or other 
materials with which it is naturally associated. For example, 
an isolated nucleic acid molecule could be part of a vector or 
a composition of matter, or could be contained within a cell, 
and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of 
the nucleic acid molecule. 
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The nucleic acid molecules of the invention can be both DNA 
and RNA molecules. Suitable DNA molecules are, for example, 
genomic or cDNA molecules. It is understood that all nucleic 
acid molecules encoding all or a portion of cyplasin are also 
included, as long as they encode a protein with biological 
activity. The nucleic acid molecules of the invention can be 
isolated from natural sources or can be synthesized according 
to known methods. 

The nucleic acid molecules of the present invention also 
include molecules with sequences that are degenerate as a 
result of the genetic code. 



In a further embodiment, the present invention provides 
nucleic acid molecules which comprise fragments, derivatives 
and allelic variants of the nucleic acid molecules described 
above encoding a protein of the invention. "Fragments" are 
understood to be parts of the nucleic acid molecules that are 
long enough to encode one of the described proteins. These 
fragments comprise nucleic acid molecules specifically 
hybridizing to transcripts of the nucleic acid molecules of 
the invention. These nucleic acid molecules can be used, for 
example, as probes or primers in a diagnostic assay and/or kit 
and, preferably, are oligonucleotides having a length of at 
least 15, preferably at least 50 nucleotides. The nucleic acid 
molecules and oligonucleotides of the invention can also be 
used, for example, as primers for a PCR reaction. 

The term "derivative" in this context means that the sequences 
of these molecules differ from the sequences of the nucleic 
acid molecules described above at one or several positions but 
have a high level of identity to these sequences. Identity 
hereby means a sequence identity of at least 40%, in 
particular an identity of at least 60%, preferably of more 
than 80% or 85% and particularly preferred of more than 90%, 
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92%, 95% or 98%. These proteins encoded by the nucleic acid 
molecules have a sequence identity to the claimed amino acid 
sequence depicted in Fig. 2 of at least 60% or 70%, preferably 
of 80% or 85% and particularly preferred of more than 90%, 
95%, 97% and 99%. The deviations to the above-described 
nucleic acid molecules may have been produced by deletion, 
substitution, insertion or recombination. 

The nucleic acid molecules that are homologous to the above- 
described molecules and that represent derivatives of these 
molecules usually are variations of these molecules that 
represent modifications having the same biological function. 
They can be naturally occurring variations, for example 
sequences from other organisms, or mutations that can either 
occur naturally or that have been introduced by specific 
mutagenesis. Furthermore, the variations can be synthetically 
produced sequences. The allelic variants can be either 
naturally occurring variants or synthetically produced 
variants or variants produced by recombinant DNA processes. 

Generally, by means of conventional molecular biological 
processes it is possible (see, e.g., Sambrook et al., 1989, 
Molecular Cloning, A Laboratory Manual, 2 nd edition Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY) to introduce 
different mutations into the nucleic acid molecules of the 
invention. 

For the manipulation in prokaryotic cells by means of genetic 
engineering the nucleic acid molecules of the invention or 
parts of these molecules can be introduced into plasmids 
allowing a mutagenesis or a modification of the sequence by 
recombination of DNA sequences . By means of conventional 
methods (cf. Sambrook et al., 1989, Molecular Cloning: A 
Laboratory Manual, 2 nd edition, Cold Spring Harbor Laboratory 
Press, NY, USA) bases can be exchanged and natural or 
synthetic sequences can be added. In order to link the DNA 
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fragments with each other adapters or linkers can be added to 
the fragments. Furthermore, manipulations can be performed 
that provide suitable cleavage sites or that remove 
superfluous DNA or cleavage sites, preferably removal of the 
secretion signal. If insertions, deletions or substitutions 
are possible, in vitro mutagenesis, primer repair, restriction 
or ligation can be performed. As analysis method usually 
sequence analysis, restriction analysis and other biochemical 
or molecular biological methods are used. 

The proteins encoded by the various variants of the nucleic 
acid molecules of the invention show certain common 
characteristics, such as enzyme activity, molecular weight, 
immunological reactivity or conformation. or physical 
properties like the electrophoretical mobility, 

chromatographic behavior, sedimentation coefficients, 
solubility, spectroscopic properties, stability; pH optimum, 
temperature optimum. 

The invention furthermore relates to vectors containing the 
nucleic acid molecules of the invention. Preferably, they are 
plasmids, cosmids, viruses, bacteriophages and other vectors 
usually used in the field of genetic engineering. Vectors 
suitable for use in the present invention include, but are not 
limited to the T7-based expression vector for expression in 
bacteria, the pMSXND expression vector for expression in 
mammalian cells and baculovirus-derived vectors for expression 
in insect cells. Preferably, the nucleic acid molecule of the 
invention is operatively linked to the regulatory elements in 
the recombinant vector of the invention that guarantee the 
transcription and synthesis of an RNA in prokaryotic and/or 
eukaryotic cells that can be translated. The nucleotide 
sequence to be • transcribed can be operably linked to a 
promoter like a T7, metallothionein I or polyhedrin promoter. . 

In a further embodiment, the present invention relates to 
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recombinant host cells transiently or stably containing the 
nucleic acid molecules or vectors of the invention. A host 
cell is understood to be an organism that is capable to take 
up in vitro recombinant DNA and, if the case may be, to 
synthesize the proteins encoded by the nucleic acid molecules 
of the invention. Preferably, these cells are prokaryotic or 
eukaryotic cells, for example mammalian cells, bacterial 
cells, insect cells or yeast cells. The host cells of the 
invention are preferably characterized by the fact that the 
introduced nucleic acid molecule of the invention either is 
heterologous with regard to the transformed cell, i.e. that it 
does not naturally occur in these cells, or is localized at a 
place in the genome different from that of the corresponding 
naturally occurring sequence. 

A further embodiment of the ' invention relates to isolated 
proteins exhibiting biological properties of cyplasin, 
preferably cyplasin wherein the normally occurring secretion 
sequence has been removed or is non-functional, and being 
encoded by the nucleic acid molecules of the invention, as 
well as to methods for their production, whereby, e.g, a host 
cell of the invention is cultivated under conditions allowing 
the synthesis of the protein and the protein is subsequently 
isolated from the cultivated cells and/or the culture medium. 
Isolation and purification of the recombinantly produced 
proteins may be carried out by conventional means including 
preparative chromatography and affinity and immunological 
separations involving affinity chromatography with monoclonal 
or polyclonal antibodies. As used herein, the term "isolated 
protein" includes proteins substantially free of other 
proteins, nucleic acids, lipids, carbohydrates or other 
materials with which it is naturally associated. Such proteins 
however not only comprise recombinantly produced proteins but 
include isolated naturally occurring proteins, synthetically 
produced proteins, or proteins produced by a combination of 
these methods. Means for preparing such proteins are well 
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understood in the art. The proteins of the invention are 
preferably in a substantially purified form. 

In addition to cyplasin, there are a variety of proteins wich 
are cytotoxic for tumor cells and, thus, might be of 
therapeutic value. For obtaining such proteins in sufficient 
quantity/quality they have to be • recombinant ly produced. 
Preferably, said proteins should be produced in mammalian 
cells, preferably in human cells, in order to ensure that the 
secondary modifications required for cytotoxic acitivity like 
glycosylation are present. However, in cases where the protein 
is secreted from the host cells and cytotoxic only if reacting 
with the outside of the cell membran, its recombinant 
production can not be achieved since the secreted protein 
kills its host cells. It has been found by the present 
inventors that this problem can be overcome, i.e. that such a 
cytotoxic protein can be produced in mammalian cells if its 
export from the host cells is blocked after synthesis. This 
can be accomplished by expressing a gene encoding a protein 
without secretory leader sequence. Such a modified protein 
will remain in its host cell and, thus, is no longer cytotoxic 
for the cell. After lysis/homogenisation of the cells, the 
protein will be released and can be isolated and purified as a 
cytotoxic compound by established biochemical methods (see 
Example 11, below, relating to the recombinant production of 
cyplasin with removed secretory leader sequence in HeLa 
cells) . 

Thus, the present invention also relates to a general method 
of making a protein in eukaryotic host cells, preferably 
mammalian cells (e.g. HeLa cells), which is cytotoxic for said 
cells when externally applied, comprising: 

(a) culturing a host cell transfected with a nucleic acid 
sequence encoding said protein with a deleted or non- 
functional secretory signal sequence under conditions such 
that said protein is expressed; and 
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(b) recovering said protein from the cells. 



Methods for the transfection of cells, recombinant production 
of proteins and recovery of the proteins from the cells have 
already been described above. The person skilled in the art 
can generate a nucleic acid sequence encoding a protein which 
is no longer secreted from the host cells by well known 
methods, e.g. by in vitro mutagenesis. Methods for determining 
the location/position of a secretory signal sequence are also 
well known and, e.g., described in [19-22]. 

Finally, the present invention also relates to a 
pharmaceutical composition comprising <i) a nucleic acid 
molecule, preferably a nucleic acid molecule encoding the 
protein with or without the secretory leader sequence, or (ii) 
protein of the invention in combination with a 
pharmaceutically acceptable recipient, diluent or carrier as 
well as the use of said compounds for preparing a 
pharmaceutical composition for treating cancer. 

Examples of suitable pharmaceutical carriers etc. are well 
known in the art and include phosphate buffered saline 
solutions, water, emulsions, such as oil/water emulsions, 
various types of wetting agents, sterile solutions etc. Such 
carriers can be formulated by conventional methods and can be 
administered to the subject at a suitable dose. Administration 
of the suitable compositions may be effected by different 
ways, e.g. by intravenous, intraperitoneal, subcutaneous, 
intramuscular, topical or intradermal administration. The 
route of administration, of course, depends on the nature of 
the tumor, its localisation and the kind of compound contained 
in the pharmaceutical composition. The dosage regimen will be 
determined by the attending physician and other clinical 
factors. As is well known in the medical arts, dosages for any 
one patient depends on many factors, including the patient's 
size, body surface area, age, sex, the particular compound to 
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be administered, time and route of administration, the kind 
and stage of the tumor, general health and other drugs being 
administered concurrently. 

The delivery of the nucleic acid molecules of the invention 
can be achieved by direct application or, preferably, by using 
a recombinant expression vector such as a chimeric virus 
containing these compounds or a colloidal dispersion system. 
Direct application to the target site can be performed, e.g., 
by ballistic delivery, as a colloidal dispersion system or by 
catheter to a site in artery. The colloidal dispersion systems 
which can be used for delivery of the above nucleic acid 
molecules include macromolecule complexes, nanocapsules, 
microspheres, magnetospheres, beads and lipid-based systems 
including oil-in-water emulsions (mixed), micelles, liposomes 
and lipoplexes, The preferred colloidal system is a liposome. 
The composition of the liposome is usually a combination of 
phospholipids and steroids, especially cholesterol. The 
skilled person is in a position to select such liposomes which 
are suitable for the delivery of the desired nucleic acid 
molecule. Organ-specific or cell-specific liposomes can be 
used in order to achieve delivery, only to the desired tumor. 
The targeting of liposomes can be carried out by the person 
skilled in the art by applying commonly known methods. This 
targeting includes passive targeting (utilizing the natural 
tendency of the liposomes to distribute to cells of the RES in 
organs which contain sinusoidal capillaries) or active 
targeting (for example by coupling the liposome to a specific 
ligand, e.g., an antibody, a receptor, sugar, glycolipid, 
protein etc., by well known methods). In the present invention 
monoclonal antibodies are preferably used to target liposomes 
to specific tumors via specific cell-surface ligands. 

Preferred recombinant vectors useful for gene therapy are 
viral vectors, e.g. adenovirus, herpes virus, vaccinia, or, 
more preferably, an RNA virus such as a Retrovirus". Even more 
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preferably, the retroviral vector is a derivative of a murine 
or avian retrovirus. Examples of such retroviral vectors which 
can be used in the present invention are: Moloney murine 
leukemia virus (MoMuLV) , Harvey murine sarcoma virus (HaMuSV) , 
murine mammary tumor virus (MuMTV) and Rous sarcoma virus 
(RSV) . Most preferably , a non-human primate retroviral vector 
is employed, such as the gibbon ape leukemia virus (GaLV) r 
providing a broader host range compared to murine vectors. 
Since recombinant retroviruses are defective, assistance is 
required in order to produce infectious particles. Such 
assistance can be provided, e.g., by using helper cell lines 
that contain plasmids encoding all of the structural genes of 
the retrovirus under the control of regulatory sequences 
within the LTR. Suitable helper cell lines are well known to 
those skilled in the art. Said vectors can additionally 
contain a gene encoding a selectable marker so that the 
transduced cells can be identified. Moreover, the retroviral 
vectors can be modified in such a way that they become target 
specific. This can be achieved, e.g., by inserting a 
polynucleotide encoding a sugar, a glycolipid, or a protein, 
preferably an antibody. Those skilled in the art know 
additional methods for generating target specific vectors. 
Further suitable vectors and methods for in vitro- or in vivo- 
gene therapy are described in the literature and are known to 
the persons skilled in the art; see, e.g., WO 94/29469 or WO 
97/00957. 

In order to achieve expression only in the target organ, e.g., 
a particular tumor to be treated, the nucleic acid molecules 
of the present invention can be linked to a tissue specific 
promoter and used for gene therapy. Such promoters are well 
known to those skilled in the art (see e.g. Zimmermann et al., 
(1994) Neuron 12, 11-24; Vidal et al.; (1990) EMBO J. 9, 833- 
840; Mayford et al., (1995), Cell 81, 891-904; Pinkert et al., 
(1987) Genes & Dev. 1, 268-76). 
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The following Examples illustrate the invention. 

In summary, the results of the Examples support previous 
suggestions pointing to cytotoxic substances of high molecular 
weight that are produced and secreted by Aplysia species [5, 
6] . Protein fractions from the secreted mucus of A. punctata 
show cytotoxic and finally killing activity when added to 
cells that grow independently of proliferation-controlling 
activities, e.g. in culture. One of these factors has been 
characterized on the peptide sequence level and it has been 
termed cyplasin. Interestingly, cyplasin shows a graded 
cytoxicity on cells in culture. It is highly cytotoxic to 
established cell lines, as shown for the glia cell line and 
PtK cells, as well as to many primary tumor cells, such as the 
human melanoma tested. Human skin fibroblasts show a 
significantly higher tolerance. Since other tumor cells tested 
are also highly sensitive (not shown) it appears that cyplasin 
is especially cytotoxic to established cell lines and to 
primary tumor cells. The different response of primary human 
fibroblasts is probably due to the fact that these cells 
cannot be considered as tumor cells although growing 
autonomously [17] . Accordingly, cyplasin might be useful for 
the specific elimination of non-desired cells in an organism, 
such as tumor cells. 

Such a view is supported by preliminary in vivo experiments. 
In no case a toxic effect of the injected cyplasin was found 
when injected in normal mice, even when high concentrations of 
cyplasin were used. 

The natural source for cyplasin is limited.? hence, its 
recombinant production appears to be a prerequisite for its 
potential application as an anti-cancer drug. In a first step 
we searched for a cDNA, which could be considered to encode 
the protein with an apparent molecular mass of 56 kDa, which 
had been isolated by the bioassay-guided fractionation 



-17- 




procedure. Using a subsequence of this protein as probe and 
conventional PCR and cDNA cloning techniques we found that 
more than one A. punctata transcript comprises the subsequence 
used as specific probe. Two cDNAs encoding polypeptides with 
diverging carboxy termini could be identified on the sequence 
level. Moreover, individual cDNA clones showed slightly 
diverging nucleotide sequences when PCR products were cloned 
which were prepared on the basis of complete A. punctata cDNA 
library template and primer pairs fitting the coding regions 
of the cDNAs identified in the first step. Actually , all 
individual clones investigated so far showed slightly 
different nucleotide sequences with the consequence of one or 
more amino acid exchanges in the corresponding polypeptide. It 
is highly unlikely that all these transcripts originate from 
different genes in A. punctata. Posttranslational processes 
like alternative splicing, differential polyadenylation and 
RNA editing could result in transcripts encoding the different 
polypeptides . 

At this stage it is unknown whether the different polypeptides 
identified at the transcript level exhibit all identical 
functions. In this situation it appeared worthwhile to select 
only one cDNA species (encoding the protein termed cyplasin-L) 
and to investigate whether this sequence could encode a 
cytotoxic protein. The recombinant polypeptide produced in E. 
coll was found to be biologically inactive. However, 
eukaryotic cells transfected with constructs expressing this 
selected cDNA or this cDNA in fusion with the EGFP-encoding 
nucleotide sequence produced a cytotoxic factor that was not 
present in non-transf ected cells nor in cyplasin-S transfected 
cells. Insect cells (Sf9) transfected with pIZ-driven 
expression constructs became especially useful. In this case 
stably transfected cell lines could be established which 
permitted the preparation of biologically active EGFP-tagged 
cyplasin-L in quantities sufficient to compare the biological 
activity of the recombinant protein with the material that can 
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be biochemically isolated from the secreted mucus of a. 
punctata. The very similar morphological effects achieved by 
the biochemical isolate and by the recombinantly expressed 
protein suggesed that the selected cDNA is a valid clone and 
that it encodes a protein presenting the cytotoxic principle 
of the genuine cyplasin of A. punctata. Finally, it could be 
shown that biologically active cyplasin could be recombinantly 
produced in HeLa cells when using a DNA sequence encoding the 
protein without the secretory signal sequence. With the 
availability of bioactive recombinant cyplasin, it is now 
possible to evaluate its potential anti-tumor therapeutic 
value . 



Example 1: Materials and Methods 

(A) Biochemical isolation of cyplasin 

Mucus of albumen glands of the sea hare A. punctata can be 
obtained from animals during the spawning season when they 
come to the shore (around April on lie d'Yeu) . By gently 
squeezing the animal, the mucus (approximately 2.5 ml) is 
excreted as purple fluid forming a gel when exposed to air. It 
is immediately diluted (1:1, vol/vol) with phosphate-buffered 
saline (PBS, 150 mM NaCl, 10 mM NaH 2 P0 4 , pH 7.2) and placed at 
4°C. After 2-3 hours the mixture becomes completely soluble. 
This step is followed by centrifugation at 10.000 x g, 15 min, 
4°C, to remove debris. The supernatant can be frozen and kept 
at -80°C without loss of activity. For further purification 
the mucus is dialysed against 1000 volumes of 50 mM MOPS, 1 mM 
dithioerythritol, 0 . 5 mM EDTA, 5 mM KC1, pH 7.2 for 24 hours 
at 4°C. Protein fractions containing the cytotoxic activity 
were isolated by fractionated precipitation with ammonium 
sulphate. Cytotoxic activity was detected in precipitates 
collected between 33% / 50% (pellet 1) and 50 % / 66% (pellet 
2) saturation, respectively. Most of the cytotoxic activity 
was usually found in pellet 1. For cytotoxicity tests pellets 
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were dissolved in 300 pi PBS, dialysed against the buffer 
described above- The most active fractions comprised 
protein (s) migrating as an essentially single band on a SDS- 
PAGE gel (Fig. 1) . 

(B) Identification of the SGDYILIASYAD peptide in the fraction 
of cytotoxic protein (s) 

Material used for the microsequencing procedure was further 
purified by gel filtration (G-200-column, Sigma-Aldrich, 
TaufJcirchen, Germany) in a buffer comprising 50 mM MOPS, 1 mM 
dithioerythritol, 0.5 mM EDTA, 5 mM KC1, at pH 7.2. The 
dialyzed and lyophilized efflux was submitted to SDS-PAGE and 
blotted to a PVDF membrane (ProtoBlot, Applied Biosys terns) . 
Sections containing the region of interest were analysed by 
microsequencing procedures performed by WITA GmbH ( Berlin , 
Germany) . 

(C) Cytotoxicity test 

Aliquots from each pellet, dissolved in 300 \xl PBS, were 
tested for their toxic effect on autonomously growing cells. 
The term * autonomously growing cells' is used for all cells 
capable of proliferating in vitro, in contrast to cells 
proliferating within an organism. Routine tests were performed 
using the rat kangaroo cell line PtK 2 and the human cell line 
HeLa. Cells were seeded in 24-well plates containing 500 pi 
medium per well, using cell densities resulting in about 50% 
confluency after 24 hours. At this time undiluted aliquots of 
the redissolved pellet (s) (5 |il) were added and cell cultures 
in parallel wells were supplemented with aliquots (5 jil) of 
serial dilutions. 

(D) Characterization of cell death induced by genuine cyplasin 
Morphological alterations of cells undergoing cyplas in-induced 
death were recorded by light microscopy. In addition 
permeability changes of the plasma membranes were investigated 
by incubating the cyplasin-treated cells with the non-membrane 



-20- 



permeant compound H33257 (SIGMA-ALDRICH, Taufkirchen, 
Germany), 0.5ug/ml, or propidium iodide (Boehringer 
Ingelheim, Germany), 1 ug/ml. Staining of nuclei was 
considered as indication for pathological permeability changes 
associated with necrosis or the final stages of apoptosis. To 
differentiate the apoptotic form of death, cyplasin-treated 
cells were incubated in 5 ug/ml of FITC-labeled Annexin V 
(Boehringer Ingelheim, Germany) for 20 min in Ca 2+ -containing 
buffer and the presence of a potential phosphatidyl serine- 
annexin complex was evaluated by fluorescence microscopy using 
appropriate filters (8). For control, apoptosis was induced in 
cells by incubation with 0.2 ug/ml staurosporine for three 
hours. This treatment induced a clear translocation of 
phosphatidylserine to the outer face of the plasma membrane, 
thus becoming accessible to the FITC-Annexin [9]; the 
concentration of staurosporine, however, was sufficiently low 
to prevent the parallel staining of cell nuclei with propidium 
iodide . 

(E) A. punctata cDNA 

Total RNA was isolated from albumen glands of the sea hare A. 
punctata by means of the Qiagen RNA isolation kit. The 
Clontech SMART II PCR cDNA synthesis kit (K1052-1) was used to 
convert 100 ng amounts of total RNA into cDNA. First strand 
synthesis was primed with the modified oligo-dT included in 
the kit and primer extension was performed with the 
recommended RNase H" point mutant reverse transcriptase 
(Superscript II, Gibco BRL) . The SMART II oligo inducing the 
template switch at 5* ends was included in the first-strand 
reaction. These reactions and PCR amplifications of first- 
strand cDNA by means of the modified oligo (dT) and SMART II 
primers were performed according to the instructions of the 
producer of the kit. 

(F) Molecular cloning of cDNAs encoding proteins comprising 
the peptide SGDYILIAS YAD 
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Amplified cDNA was used as a template and PCR reactions were 
primed with combinations of specific primers corresponding to 
the search sequence and with non-specific primers , e.g. 
modified oligo-dT and Smart II, respectively. Amplification 
products were recloned in a pBluescript-derived T-overhang 
vector and sequenced. The validity of these sequences was 
verified by PCR reactions primed with oligo deoxynucleotides 
corresponding to sequences upstream and downstream of the 
specific SGDYILIASYAD-encoding primer. These probe- independent 
products contained the nucleotide sequence* encoding the 
peptide SGDYILI AS YAD . Sequences found upstream of 
SGDYILIASYAD-encoding sequence were unique, except for several 
base exchanges discussed in the text. In contrast, two 3' end 
sequences could be detected differing in length (L and S) . 

(G) Fusion and expression constructs 

The protein-coding sections were PCR amplified with primers 
placing suitable restriction sites to the 5' and 3' ends of 
the amplification products. Following digestion with the 
corresponding restrictases the products were either directly 
cloned into the expression vectors pcDNA3 (Invitrogen, for 
expression in mammalian cells), pQE30 (Qiagen, for expression 
in E. coli) f pIZ/V5-His (Invitrogen, for expression in insect 
cells) or fused with the EGFP-encoding cDNA (Clontech) 
prepared in the Xhol / NotI sites of the pBluescript vector. 
Excision of the EGFP-tagged fragments and recloning in 
appropriate sites of the pcDNA3 vector or the pIZ/V5-His 
vector resulted in the corresponding cyplasin-EGFP expression 
constructs suitable for expression of f luorescently labeled 
fusion proteins in mammalian and insect cells, respectively. 

(H) Trans feet ions and recombinant protein expression 

JE. coli M15 cells were transformed with the pQE30 plasmids 
containing the cyplasin-L and cyplasin-S-encoding inserts in 
frame with the His-tag of the vector. The expressed His-tagged 
proteins were isolated by means of Ni-NTA agarose according to 
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the protocol supplied by Qiagen. HeLa cells were transfected 
with the pcDNA3 plasmids containing either EGFP-tagged or non- 
tagged cyplasin-L and cyplasin-S-encoding inserts by means of 
the Ef fectene transfection kit (Qiagen) . Cells transfected 
with constructs containing the insert encoding cyplasin-L or 
cyplasin-L-EGFP could not survive longer periods. However, 
supernatants of such cultures contained the cytotoxic factor 
described in the text. SF9 cells were transfected with the 
pIZ/V5-His plasmids containing either EGFP-tagged or non- 
tagged cyplasin-L encoding inserts using in addition the 
Effectene transfection kit (Qiagen) . In contrast to mammalian 
cells transfected insect cells survived. Expression was 
followed either by fluorescence microscopy of living cells or 
by testing of cytosolic extracts for the presence of a 
cytotoxic factor. 

(I) Stably-transfected SF9 cells for large scale production of 
cyplasin-L-EGFP 

SF9 cells transfected with the plasmid pIZ/V5-His-cyplasin-L- 
EGFP were grown for three months as semi-attached cells at 
26°C in TNM-FH insect medium (Applichem, Darmstadt, Germany) 
supplemented with 10% fetal calf serum, 5 mM Glutamax (LIFE 
Technologies, Karlsruhe, Germany) and 100 ug/ml zeocin 
(Invitrogen, Groningen, The Netherlands) . The cell cultures 
were diluted 1:3 at four-day intervals. The original 
transfection efficiency was approximately 10%, after a three- 
months' period 5% of the cells remained fluorescent. The 
latter fraction was considered to be stably transfected. Cells 
of this fraction were separated by means of a fluorescence- 
activated cell sorter (Beckton-Dickinson) . Following a second 
sorting performed after four weeks the resulting culture could 
be grown in spinner cultures up to several litres and more 
than 90% of these cells expressed cyplasin-L-EGFP fusion 
protein. 

(J) Recovery of the cytotoxic factor from SF9 cells stably 
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expressing cyplasin-L-EGFP 

The EGFP-tagged cyplasin-L is not secreted into the medium of 
SF9. Routinely, 1-2 x 10 8 stably transfected Sf9 cells were 
washed by suspension and centrif ugation (1000 x g, 3 min) , 
once in PBS, and once in 50 mM MES, 1 mM EDTA, 5 mM KC1, 0.1% 
Mercaptoethanol, pH 6.0. They were homogenized in 5 ml of the 
latter buffer. Homogenization and all subsequent steps were 
performed at 4°C. A protease inhibitor cocktail (Roche 
Diagnostics, Mannheim, Germany) was present throughout the 
purification procedure. The homogenate was centrif uged 
(100.000 x g, 60 min), and the supernatant was applied to a 
DEAE-Cellulose column (DE52, Sigma Aldrich, Taufkirchen, 
Germany) that had been equilibrated with the buffer described 
above. The column was washed extensively with the buffer used 
for equilibration followed by application of a NaCl-gradient 
(0-200 mM) . Eluted fractions were tested for the presence of 
the cytotoxic factor by addition of 100 jal of each fraction to 
indicator cells (PtK) growing in 500 pi culture medium. If 
present, cytotoxic effects were observed after about five 
hours. Factor-containing fractions were eluted between 60 and 
80 mM NaCl. Fractions with these characteristics were 
considered as ^standard' extracts, and used for other 
biological tests, e.g. those described in Fig. 5. 

(K) Identification of cyplasin-L-EGFP in cytotoxic extracts 
isolated from stably transfected SF9 cells 

Protein fractions isolated as described above and exhibiting 
cytotoxic activity were concentrated and separated by 12.5% 
SDS-PAGE. Two identical samples (including a protein standard) 
were separated on the same gel. One section of the gel was 
stained using a silver-staining procedure; the other section 
was electroblotted (semi-dry blotting apparatus , Biometra, 
Gottingen, Germany) to a PVDF transfer membrane (Westran, 
Schleicher & Schuell, Dassel, Germany) . Buffer composition was 
3.03 g boric acid, 200 ml methanol, 800 ml H 2 0, pH 9.0. 
Following blocking with BLOTTO [10] the membrane was incubated 
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for 3 hours (26°C) with anti-GFP antibody (ABCAM, Cambridge, 
U.K.) diluted 1:2000 in PBS, pH 7.2, containing 0.1% BSA. 
After prolonged rinsing in PBS immunodetection was performed 
by means of an alkaline phosphatase-coupled goat-anti-rabbit 
antibody (Dianova, Hamburg, Germany) which was applied for 
three hours at 26°C, diluted 1:12000 PBS, pH 7.2, containing 
0.1% BSA. The blot was rinsed in PBS and placed into the 
staining solution consisting of 100 mM TRIS, 5 mM MgCl 2 , 0.3 mg 
/ ml nitro blue tetrazolium, 0.15 mg / ml 5-bromo-4-chloro-3- 
indolylphospate, pH 9.5. 



(L) Animal experiments 

DBA2 mice were injected with 300 ul (10 uM) genuine cyplasin, 
either in the tail vein (group 1) or subcutaneous ly (group 2). 
Cyplasin had been dialysed before against a large volume of 
PBS for 24 h at 4° C and tested for positive cytoxicity 
immediately before injection by incubating PtK cells with 10 
nM cyplasin. Recombinant cyplasin was also dialysed against 
PBS, tested for positive cytotoxicity before injection, and 
300 pi were injected into the tail vein. Mice were maintained 
under standard conditions and observed for four weeks. 

(M) Stably-transfected HeLa cells for large scale production 
of cyplasin-L-EGFP without secretory signal sequence 
Following digestion with the corresponding restrictases the 
cyplasin L- (-Sig. Seq) -encoding cDNA was fused with the EGFP- 
encoding cDNA (Clontech, Heidelberg, Germany) prepared in the 
Xhol/Notl sites of the pBluescript vector. Excision of the 
EGFP-tagged fragments and recloning in the appropriate sites 
of the pcDNA3 vector (Invitrogen) resulted in the 
corresponding Cyplasin-L (-Sig. Seq. ) -EGFP expression of the 
fluorescently labelled fusion proteins in mammalian cells. 
Selection of successfully transfected cells was achieved using 
G-418-sulfate resistance. 



(N) Other methods 
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Database searches and sequence analyses were performed by 
means of the HUSAR program package (DKFZ) that is a collection 
of sequence analysis tools based on the GCG program package 
developed by GCG Inc. (Madison, WI., USA) . For the 
identification of the secretory signal sequence we applied the 
McGeoch scan program [11] . DNA sequencing was performed by A. 
Hunziker (German Cancer Research Center) by means of an 
automatic DNA-sequencer, model 373A (Applied Biosystems) . 



Example 2: Molecular cloning of cyplas in-encoding cDNAs 

cDNA prepared from RNA of the albumen gland of A. punctata 
comprises more than one transcript encoding the peptide 
SGDYILIASYAD . Two cDNAs were cloned encoding proteins which 
diverge significantly in their carboxy-terminal sections but 
which comprise the target sequence (Fig. 2) . One of these 
cDNAs encodes a protein of 558 aa residues with a molecular 
mass of 62.4 kDa (termed- cyplasin-L) while another cDNA 
reflects a transcript encoding a shorter protein (421 aa 
residues, molecular mass 46.9 kDa, termed cyplasin-S). 
Moreover, PGR on total cDNA with cyplasin-L specific primer 
pairs results in DNA fragments whose sequences diverge from 
those encoding cyplasin-L and cyplasin-S, respectively. 
Accordingly, mRNAs appear to exist which are neither identical 
with cyplasin-L nor with cyplasin-S. These sequence micro 
heterogeneities suggest that A. punctata produces an unknown 
number of very similar but not 100% identical proteins that 
comprise the target sequence. On the basis of the available 
data it cannot be decided whether these different mRNAs and 
proteins derive from one single gene, e. g. by alternative 
splicing in combination with RNA editing, or whether there 
exists a cluster of very similar but not 100% identical genes. 
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Exan$>le 3: Sequence characteristics of the proteins cyplasin-L 
and cyplasin-S encoded by the cloned cDNAs 

Biochemical data suggest (not shown) that the naturally 
occurring cyplasin is a glycoprotein. The cyplasin-L cDNA- 
derived amino acid sequence comprises five Asn-linked (N-X-S 
or N-X-T) glycosylation sites at positions N-151, N-271, N- 
401, N-416 and N-422 that is in agreement with the biochemical 
data. The glycosylation sites 1-4 are unchanged in the 
polypeptide derived from the cyplasin-S cDNA while the 
position N-422 is missing in the shorter sequence. 

The N-termini start with a hydrophobic secretory signal 
sequence of high probability and a predicted cleavage site 
between aa residues 52 (Ser) and 53 (Ala) . Accordingly, the 
molecular masses of the mature and expectedly functional 
proteins amount to 57.2 kDa and 41.6 kDa, respectively. The 
calculated isoelectric points of these mature proteins are 
5.54 (charge -13) for cyplasin-L and 6.20 (charge -5) for 
cyplasin-S. 

Database searches with the nucleotide sequence released 
similarities with two other Aplysia sequences, namely A. 
kurodai albumen gland mRNA for aplysianin-A precursor ([12J, 
70.9 % identities, D83255) , and A. fulica Ferussac mRNA for 
achacin ([7], 52.2 % identities, X64584) . Database searches 
with cyplasin sub-sequences released the amino acid sequences 
of the Aplysia species mentioned above and a number of protein 
sequences with longer strings of local identities or 
homologies. The latter sequences all belong to the class of 
monoamine oxidases. Table 1 shows alignments of one prominent 
cyplasin peptide string with subsequences of eukaryotic and 
prokaryotic monoamine oxidases . It is of interest to note that 
database searches with this and other cyplasin-typical strings 
released no significant hits with proteins from other classes. 
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Table 1: Database searches with the pCyplas in-derived amino 
acid sequence resulted in a number of hits with sequences 
reflecting monoamine oxidases 
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Cyplasin 

151346 

OXLA_CROAD 

AOFA_BOVIN 

AOFBJRAT 

AO F A__HUMAN 
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T22714 

AOFHJ4YCTU 
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A. punctata 

Rainbow trout 

Crotalus 

Bovin 

Rat 

Human 

Human 

Prokaryotic 
Prokaryotic 
Prokaryotic 
Prokaryotic 
Prokaryotic 
Prokaryotic 
Prokaryotic 
Prokaryotic 
Prokaryotic 



Example 4: Expression of biologically inactive recombinants in 

E . coli 

Recombinant expression of cyplasin-encoding cDNA sequences in 
the pQE / E. coli M15 system results in polypeptides which are 
completely insoluble in buffers containing no detergents, and 
suspensions of such recombinantly expressed polypeptides could 
not exert any cytotoxic effect when incubated together with 
cultured cells (not shown) . This missing cytotoxic activity is 
suggestively due to incorrect folding and / or the absence of 
posttranslational modifications of the polypeptides expressed 
in the E. coli system. 
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Example 5: Generation of bioactive recombinants in mammalian 

cells 



In contrast, mammalian cells, e.g. HeLa S3 suspension cells, 
produce a cytotoxic factor when transfected with CMV vector- 
driven expression constructs specifying either cyplasin-L or 
EGFP-tagged cyplasin-L. This factor is not detectable in 
cultures of non-transf ected cells nor in cultures transfected 
with constructs expressing the cyplasin-S version. The 
production of the cytotoxic factor is obvious because all 
cells of factor-producing cultures finally die in the typical 
manner that is observed when mammalian cells are treated with 
genuine cyplasin isolated from the mucus of A. punctata. Since 
only a fraction of cells in such cultures is transfected it 
follows that the cytotoxic factor must be released from the 
producer cells with the consequence of cell death of producer 
and non-producer cells. The release of the cytotoxic factor is 
well in agreement with the predicted secretory signal at the 
amino terminus of the cDNA-derived amino acid sequence (Fig. 
2) . 

Although this self-destructing system is not suitable to 
produce significant amounts of biologically active 
recombinants it reveals the validity of the cDNA cloning 
approach and it indicates that the factor encoded by the cDNA 
with the longer insert shows the cyplas in-typical 
characteristics . 

Example 6: Recombinant expression of bioactive cyplasin-L and 

cyplasin-Ii-EGFP in insect cells 

Insect cells (e.g. Sf9) are known to be able to perform 
posttranslational modifications similar to mammalian cells. 
Since Sf9 cells proved to be much less sensitive to genuine 
cyplasin preparations (not shown) they are especially suited 
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to generate recombinant cyplasin in sufficient amounts for 
biological tests. Transfection of SF9 cells with pIZ vector- 
driven constructs specifying the expression of cyplasin-L or 
of EGFP-tagged cyplasin-L could not influence the 
proliferation rate of SF9 cells. Moreover, the spent medium of 
SF9 cells transfected with the construct specifying cyplasin-L 
contained significant cytotoxic activity for mammalian cell 
cultures, which shows that the secretory signal of cyplasin-L 
is also functioning in insect cells. 

In contrast, no cytotoxic factor was released from SF9 cells 
transfected with the construct specifying EGFP-tagged 
cyplasin-L. The cyplasin-L-EGFP fusion protein is clearly 
expressed in SF9 cells, as shown by EGFP-dependent 
fluorescence (Fig. 3), but no significant amounts of the 
cytotoxic factor can be detected in the spent medium of 
spinner cultures. Interestingly, the Western blot shown in 
Fig. 4 points to the deletion of the signal sequence in the 
cyplasin-L section of the fusion protein. This cleavage must 
occur in such a way that the truncated fusion protein remains 
cytosolic. Alternatively, retrograde translocation from the ER 
to the cytosol has to be assumed. Such retrograde 
translocations have been observed in other systems before [13, 
14, 15, 16] . 

However, the cytotoxic activity of the recombinant ly expressed 
truncated cyplasin-L is maintained when fused to EGFP. The 
high-speed supernatant of homogenized cyplasin-L-EGFP- 
expressing SF9 cells was found to contain the factor that is 
cytotoxic to cultured mammalian cells. Consequently, stably 
transfected cyplasin-L-EGFP expressing Sf9 cell lines were 
generated by fluorescence activated cell sorting (FACS) , and 
fractions of the high-speed supernatants of such cultures 
contained the cyplasin-L-EGFP fusion protein (Fig. 4) and 
exhibited the biological activities shown in Fig. 5. 
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Example 7 : Characteristic features of cyplasin-dependent 

cytotoxicity 

Proliferating mammalian cells exhibit characteristic time and 
concentration-dependent morphological changes when treated 
with, the biochemically isolated genuine cyplasin from the 
mucus of A. punctata (Fig. 5) . The cytotoxic effects of the 
genuine cyplasin become visible, e.g. in PtK cells, in less 
than one hour at 50 nM. For this cell line the minimum 
cytotoxic cyplasin concentration is in the order of 2 nM, 
however, at this concentration the cytotoxic effects appear 
foremost after 24 hours. Once induced, the cyplasin effect is 
irreversible and cell death is observed even if cyplasin- 
containing medium is replaced by fresh medium. Other cultured 
mammalian cells show somewhat lower (human skin fibroblasts) 
or even higher sensitivity (human melanoma cells, glia cells) 
(Fig. 5). 

The morphology of cyplasin-induced cell death is specific. The 
cells detach from the substratum, they shrink and disjoin from 
each other if grown as monolayer or in clusters, and 
occasionally they exhibit numerous small plasma vacuoles. 
Morphological changes of this type can also be observed in 
cells undergoing apoptotic cell death, however, typical 
indicators for apoptosis including nuclear fragmentation and 
exposure of phosphatidyl serine on the outer membrane are 
missing (Fig. 6) . 

Moreover, cyplasin exerts its cytotoxic effects only on cells 
in interphase. Mitotic cells are still able to complete 
anaphase and cytokinesis at a time when most interphase cells 
in the same culture already show the cyplasin-typical change 
in morphology (Fig. 7). However, following, completion of 
mitosis these cells also die when reentering the interphase. 
Neither cell permeability nor the microtubular cytoskeleton 
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nor intracellular Ca 2+ -levels are affected by cyplasin (not 
shown) . 



Example 8: Evaluation of the bioactive recombinant cyplasin-L- 

E6FP 



A thorough side-by-side comparison of the biochemically 
isolated genuine cyplasin and the recombinant cyplasin-L-EGFP 
version meets the problem that the recombinant is, at present, 
only available on the level of enriched extracts . Although an 
exact quantitation is missing so far it is evident that the 
cyplasin-L-EGFP extracted from stably transfected SF9 cells 
exhibits cytotoxic activity which is very similar to that 
induced by the biochemically isolated genuine cyplasin. Fig. 5 
presents side by side the effects of genuine cyplasin and 
recombinant cyplasin-L-EGFP on four different cell lines with 
established different sensitivities to genuine cyplasin. Using 
constant amounts of extracts from cyplasin-L-EGFP expressing 
SF9 cells it is obvious that human skin fibroblasts (HSF) are 
relatively insensitive to recombinant cyplasin-L-EGFP which 
holds true also for the biochemically isolated genuine 
cyplasin. These cells only show a slight initial retraction 
and a weak tendency to shrink when treated either with genuine 
cyplasin (50 nM) or with the standard extracts containing the 
cyplasin-L-EGFP. Finally, they recover and continue to 
proliferate. Death of HSF cells is only observed at cyplasin 
concentrations in the order of 100 nM (not shown) . In 
contrast, cells derived from a biopsy of a human melanoma 
exhibit significantly higher sensitivity when incubated with 
genuine cyplasin (20 nM) f and with the standard extract. 
Melanoma cells treated either with the genuine cyplasin or 
with the recombinant cyplasin-L-EGFP show the typical 
cyplas in-induced retractions, the formation of vacuoles, and 
finally cell death. Other panels of this figure show glia 
cells from an established cell line derived from rat embryo 
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cortices. These cells exhibit the highest cyplasin sensitivity 
of all cells studied so far. The typical cyplasin effect is 
observed at a concentration that is as low as 0.2 nM, and 
complete cell death is observed within a five hours 
observation period. The cells of the rat kangaroo line PtK are 
irreversibly damaged within 5 hours by incubation with 2 nM 
genuine cyplasin after five hours. A similar effect is 
observed after treatment with the standard extract. Prominent 
plasma vacuolisation and membrane changes are induced in these 
cells by genuine cyplasin as well as by recombinant cyplasin- 
L-EGFP. 

Summarizing, these results show that the molecular cloning 
approach released a cDNA encoding a factor exhibiting 
cytotoxic activity similar to that detected in the secreted 
mucus of A. punctata, and that the cytotoxic effect of the 
recombinant protein is not obliterated by its fusion to EGFP. 

Example 9: Target site for cyplasin action 

The exact mechanisms behind the cytotoxic effects of cyplasin 
and recombinant cyplasin are not yet elaborated. However, it 
is unlikely that the cells take up a protein of this size with 
the consequence of exerting negative intracellular influence. 
Long-term observations of cyplasin-treated cells indicate that 
the first signs of cytotoxic action occur at the outer 
cellular membrane, at a time when the internal cell morphology 
shows no anomalies. This observation suggests that cyplasin- 
docking to the outer cellular membrane represents the trigger 
for a still unknown cascade of events that finally leads to 
cell death. This view is also in agreement with other 
observations. Mammalian cells transfected with expression 
constructs specifying cyplasin L or EGFP-tagged cyplasin L 
initially survive and they are able to produce the cytotoxic 
factor. However, they begin to exhibit the changed morphology 
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as soon as the cytotoxic factor becomes detectable in the 
spent medium. This suggests that extracellular cyplasin is 
cytotoxic while intracellular cyplasin is rather non-toxic. 
Finally f mammalian cells treated with the cyplasin-L-EGFP 
fusion protein extracted from stably transfected SF9 cells 
become surrounded by a faint halo of fluorescent fusion 
protein which is followed by the characteristic retraction and 
shrinking. 

Example 10: Absence of in vivo toxicity of cyplasin 

In order to test if cyplasin showed cytotoxic effects also in 
vivo, either genuine or recombinant cyplasin was injected into 
three groups of mice. Group 1 consisted of 12 DBA2 mice, which 
were injected with a high concentration of cyplasin into the 
tail vein. The concentration used exceeded by far the 
concentration found to be toxic in vitro. Nevertheless , all 
mice survived, at least up to four weeks. The same result was 
obtained when in a second group 12 DBA2 mice were injected 
subcutaneously under identical conditions. They survived and 
no negative effects were found during the observation period. 
Finally, a third group (6 mice) was injected into the tail 
vein using the recombinant cyplasin. Again all mice survived. 

Example 11 : Recombinant production of cyplasin without 
secretrory leader sequence in HeLa cells 

Cyplasin is cytotoxic for the host cells if secreted. Thus, it 
was investigated whether this problem can be overcome by using 
for recombinant production of cyplasin in human cells a DNA 
encoding cyplasin without secretory signal sequence. The amino 
acid sequence of cyplasin (Figure 2) was analysed using the 
"signal P program" [19-22] . Two potential cleavage sites for 
signal sequences were found in the N-terminal part of the 
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amino acid sequence which characterize cyplasin as a secreted 
protein (see Figure 10) . The putative cleavage site is between 
aa positions 19 and 20 (highest probability) or 52 and 53 
(lower probability). In order to ensure removal of the 
complete signal peptide, the DNA sequence encoding the signal 
peptide from aa position 1 to 52 was removed from the insert 
of the cyplasin encoding plasmid described in Example 1 (F,G), 
above. The modified DNA sequence comprising the EGFP-tag was 
cloned in appropriate sites of the pcDNA3-vector resulting in 
the corresponding cyplasin-L- (Sig-Seq. ) -EGFP expression 
construct. Then HeLa-cells were trans fected with this 
construct. In order to allow identification of transformed 
cells this construct was additionally capable of expressing 
the gene encoding the green fluorescent protein (GFP) as a 
marker. After selecting correctly transformed cells by use of 
the antibiotic G418-sulfate and after culturing the 
transfected cells for some weeks for selecting transformants 
with the vector stably integrated into their genome, a single 
cell cloning was carried out. As shown in Figure 12, the 
stably transfected cells express cyplasin. Cyplasin was 
isolated as described in Example 1 (J,K) and it could be shown 
by using the methods described in Example 1 (C,D) that it 
exhibits the same cytotoxic activity as the native cyplasin 
isolated from Aplysia punctata. 

A plasmid [pcDNA3-Cytoplasin.Mut- (-Sig. Seq. ) -EGFP] containing 
the coding sequence for cyplasin without a signal sequence has 
been deposited under Budapest Treaty with the DSMZ (Deutsche 
Sammlung von Mikroorganismen und Zellkulturen Gmbh, 
Mascheroder Weg 2, Braunschweig, Germany) on August 22, 2002 
under DSM 15153. 
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Claims 



1. An isolated nucleic acid molecule encoding the protein 
cyplasin with a deleted or non-functional secretory signal 
sequence or a protein exhibiting biological properties 
thereof, being selected from the group consisting of 

(a) a nucleic acid molecule encoding a protein comprising 
the amino acid sequence from position 20 or 53 to 
position 558 of Figure 2(a); 

(b) a nucleic acid molecule comprising the sequence of 
Figure 2 (b) ; 

(c) a nucleic acid molecule the nucleic acid sequence of 
which deviates from the nucleic sequences specified in 
(a) or (b) due to the degeneration of the genetic 
code ; and 

(d) a nucleic acid molecule, which represents a fragment, 
derivative or allelic variation of a nucleic acid 
sequence specified in (a) , (b) or (c) . 

2. A recombinant vector containing a nucleic acid molecule of 
claim 1 . 

3. The recombinant vector of claim 2 wherein the nucleic acid 
molecule is operatively linked to regulatory elements allowing 
transcription and synthesis of a translatable RNA in 
prokaryotic and/or eukaryotic host cells. 

4 . A recombinant host cell which contains the recombinant 
vector of claim 2 or 3. 

5. The recombinant host cell of claim 4, which is a mammalian 
cell, a bacterial cell, an insect cell or a yeast cell. 

6. An isolated protein exhibiting biological properties of 
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cyplasin encoded by a nucleic acid molecule of claim 1. 

7. A method of making a protein m exhibiting biological 
properties of cyplasin comprising: 

(a) culturing the recombinant host cell of claim 4 under 
conditions such that said protein is expressed; and 

(b) recovering said protein, 

8. A method of making a protein in eukaryotic host cells which 
is cytotoxic for said cells when secreted from said cells or 
externally applied comprising: 

(a) culturing a host cell * transf ected with a nucleic acid 
sequence encoding said protein with a deleted or non- 
functional secretory signal sequence under conditions such 
that said protein is expressed; and 

(b) recovering said protein. 

9. The method of claim 8 wherein the eukaryotic cells are 
mammalian cells. 

10. The protein produced by the method of claim 7 or 8. 

11. A pharmaceutical composition comprising a nucleic acid 
molecule of claim 1 or a protein of claim 6 or 10. 

12. Use of a nucleic acid molecule of 1 or a protein of claim 
6 or 10 for preparing a pharmaceutical composition for 
treating cancer. 
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Abstract 04. Sep, 2002 

Cytotoxic cyplasin of the sea hare, Aplysia punctata, cDUA 
cloning and expression of bioreactive recombinants 

Described is a nucleic acid coding for a protein called 
"cyplasin" that shows a preferential toxicity to autonomously 
growing mammalian cells. Cell death induced by this protein 
differs from both apoptosis and necrosis. An intracellular 
cell death which occurs when recombinantly preparing cyplasin 
in cell cultures can be avoided by removal of the secretion 
signal in the cyplasin sequence. This modification makes it 
possible to express the cyplasin in a mammalian cell culture 
which is preferable with regard to the glycosylation pattern 
of the obtained protein. Thus, the present invention also 
relates to a method of recombinantly producing a protein in 
eukaryotic cells, preferably mammalian cells, which is 
cytotoxic for said cells when applied externally. 
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GCC TAC CTT TTG AGG AAT AAA GGT CAG AAC 
ATC GGG GTC TTC GAA TTC TGT GAC AGA GTG 
GGT GGT CGG CTG TTC ACC TAT CAG TTG CCT 
AAT ACC CCC GAC GTG CAG CTG GAA CTG GGG 
GGG ATG CGG TAC ATC ACC GGC/GCT CAT AAC 
CTG CTC GAG GGA GTC GTT CGT CAG CTG GGA 
CTG ACC CCA GTA GTG TTT ACA GAA GGC TTC 
GGT AAG CTG GGC CGT ACA CGC TAT TAC CTG 
AGG GGA CAG TCC CTG ACC TTC CAG GAA GTG 
CTG ACA GGC GAC GTG CCA TAC AAC CTT ACC 
GTC GCG GAG AAG CAG AAC CAG GAC AAT ATT 
TTC GCC TTC TAT CTC AAG GAA CTA ACC CGT 
TTC GAC GTA GGC GAC GGT TTC GTG ACC AGA 
GAA CAA CTG CTG AAA CTG CGC GTC AGC GAT 
GGG AGG CTC CTC TAC CAA CTG ACG TTC GAC 
GAA GCC CTG GAC CTG GTA GCA TCG CCG GAA 
GGT AAA GAA TTT GCC AGG GAC ATT CAC GTG 
TTT ACG ACG GAG GTT TCA GAC GAC GCC AAC 
GCG GTT TCG GTG TTC GAC GAC CAC TTA GGT 
GAG GAC GGC GTA GGC GAG GAG ATC CAT ACC 
GTG CAA GAA GGA ATG CAG AAA GTA CCG GAG 
CAA CTG CTG CGT GCA TTT GGA AAC AGT TCC 
GTC TTC GGC CAC AGG GTC TTC ACT AAC CTG 
CAA CTG AAA GCA ATT CGA AGC AAA TCC GAC 
AAG AGC CAC GTC CTG TAC TTT AGG ACC ACC 
TCC ACG GTT GAC GGC AAA ACA ACA ATT CTC 
AAA TTC GAG CCG CTG CAG AAG GTC TGC ACG 
CGT CAG ATT ATC CTA GCT CTG CCT GTG TTC 
GCC CTC ATG CAG GTC GAT TGG CCT CCC CTG 
CGT GAG AAT CGG GCG CAG AAG GCG TAC GGC 
GCG GTC AGG ACC ATT CCA GCG AGC AAG GTC 
TTC ATG ACG TTC GAC CAA CCG TGG TGG CTT 
CAG AAC GAT GTG ACA GAC TTC CCA GCG TTT 
GTG ACC AAA GGA GAC ACC ACT TTC TCG CAA 
ATG TAC GAC TGG AAA AAG TCC GAG GCT TCT 
GGT GAC TAC ATC CTC ATC GCT TCG TAC GCC 
GAC GGC AAC AAT ACC CTC TTC CAG AGG GTG 
CTG CGC GAC CAA GGG GAG CCG ATC AAC GGC 
AGT GAA GCC GGC GCC CAC ATC GTG TCC GAG 
CCC CTT AAG AAC CAA ATT TTG GAC CAC CTC 
GCG GAC GCG TTT GGC GTC CCC CGT TCG GAC 
ATT CAG GAG CCC AAA ACG GCG GTC AGC AAG 
TTT TGG ACT GAC TAC CCG TTT GGG TGT GGA 
TGG ATT ACA TGG CGG GCC GGC TAC CAC TTC 
GAC GAT GTG ATG AAC ACC ATG CGC AGA CCC 
TCA CTC ACC GAC GAG GTC TAC GTT GTG GGT 
GCG GAC TAC TCT TGG GGC CTT ATT TCT TCC 
TGG GTG GAA GGC GCC CTG GAA ACC TCC TAC 
GAG GTA ATC GAT ACA TAC TTC AAA AGC GAG 
CGG TCA CAT AAT GTG CAA CCT CCA AGC CAC 
ATG GCC TCC CAC GTG GGC 
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AYLLRNKGQNIGVFEFCDRVGGRLFTYQLPNTPDVQLELGGMRYITGAHNLLEGWRQLG 
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LTPWFTEGFGKLGRTRYYLRGQSLTFQEVLTGDVPYNLTVAEKQNQDNIFAFYLKELTR 
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FDVGDGFVTREQLLKLRVSDGRLLYQLTFDEALDLVASPEGKEFARDIHVFTTEVSDDAN 
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AVSVFDDHLGEDGVGEEI HTVQEGMQKVPEQLLRAFGNS SVFGHRVFTNLQLKAI RS KS D 
190 200 • 210 220 230 240 

KSHVLYFRTTSTVDGKTTILKFEPLQKVCTRQIILALPVFALMQVDWPPLRENRAQKAYG 
250 260 270 280 290 300 

AVRT I PASKVFMT FDQPWWLQN DVT DFPAFVTKGDTT FSQMY DWKKS EASGDYIL I AS YA 
310 320 330 340 350 360 

DGNNTLFQRVLRDQGEPINGSEAGAHIVSEPLKNQILDHLADAFGVPRSDIQEPKTAVSK 
370 380 390 400 410 420 

FWTDYPFGCGWITWRAGYHFDDVMNTMRRPSLTDEVYWGADYSWGLISSWVEGALETSY 
430 440 450 460 470 480 

E V I DT Y FKS ERS HN VQ P PS HMAS HVG 
490 500 
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